Active learning strategies for atomic cluster expansion models
نویسندگان
چکیده
The atomic cluster expansion (ACE) was proposed recently as a new class of data-driven interatomic potentials with formally complete basis set. Since the development any potential requires careful selection training data and thorough validation, an automation construction dataset well indication model's uncertainty are highly desirable. In this work, we compare performance two approaches for ACE models based on D-optimality criterion ensemble learning. While both show comparable predictions, extrapolation grade (MaxVol algorithm) is more computationally efficient. addition, indicator enables active exploration structures, opening way to automated discovery rare-event configurations. We demonstrate that learning also applicable explore local environments from large-scale molecular-dynamics simulations.
منابع مشابه
Monitoring atomic cluster expansion by high-harmonic generation.
High-harmonic generation is shown to be capable of providing time-resolved information about the particle density of a complex system. As an example, we study numerically high-harmonic generation from expanding xenon clusters in a pump-probe laser scheme, where the pump laser pulse induces the cluster explosion and the probe pulse generates harmonics in the expanding cluster. We show that the h...
متن کاملPLAL: Cluster-based active learning
We investigate the label complexity of active learning under some smoothness assumptions on the data-generating process. We propose a procedure, PLAL, for “activising” passive, sample-based learners. The procedure takes an unlabeled sample, queries the labels of some of its members, and outputs a full labeling of that sample. Assuming the data satisfies “Probabilistic Lipschitzness”, a notion o...
متن کاملActive Learning for Pipeline Models
For many machine learning solutions to complex applications, there are significant performance advantages to decomposing the overall task into several simpler sequential stages, commonly referred to as a pipeline model. Typically, such scenarios are also characterized by high sample complexity, motivating the study of active learning for these situations. While most active learning research exa...
متن کاملActive Learning for Misspecified Models
Active learning is the problem in supervised learning to design the locations of training input points so that the generalization error is minimized. Existing active learning methods often assume that the model used for learning is correctly specified, i.e., the learning target function can be expressed by the model at hand. In many practical situations, however, this assumption may not be fulf...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Physical Review Materials
سال: 2023
ISSN: ['2476-0455', '2475-9953']
DOI: https://doi.org/10.1103/physrevmaterials.7.043801